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Abstract. We examine the one-humped map at the period-doubling 
transition to chaos, and ask whether its long-term memory is stack-like 
(last-in, first-out) or queue-like (first-in, first-out). We show that it can be 
recognized by a real-time automaton with one queue, or two stacks, and 
give several new grammatical characterizations of it. We argue that its 
memory has a queue-like character, since a single stack does not suffice. 
We also show that its dynamical zeta function, generating function and 
growth function are transcendental. The same results hold for any period- 
multiplying cascade. We suggest that transcendentality might be a sign 
of dynamical phase transitions in other systems as well. 



1 Introduction 

One of the basic distinctions in physics and dynamical systems is between finite- 
state processes, characterized by short-range, exponentially decaying correla- 
tions, and processes with an infinite amount of memory, causing long-range cor- 
relations such as power laws. 

But we can draw a finer distinction based on what kind of long-term memory 
a system has. Two of the basic data structures studied in computer science are 
the stack, in which the most recent symbols pushed on must be read and removed 
before older symbols can be read (last-in, first-out), and the queue, in which 
symbols are read in the same order they are entered (first-in, first-out). These 
are two fundamentally different ways for a system to depend on its history. 

For instance, a stack can recognize the set of palindromes, such as abccba, 
while a queue prefers to repeat things in the same order, as in abcabc. Both have 
long-range correlations; the difference is that they are nested in the first case 
and cross each other in the second, as shown in figure 1. While a stack pushes 
and pops at the same end, a queue pushes symbols on one end (the right, say) 
and pops them off the other. 

In this paper, we examine the symbolic dynamics of the one-humped map at 
the period-doubling transition to chaos. We show that it can be recognized in 
real time by an automaton with access to two stacks, or one queue. We review 
and extend previous descriptions of it as an indexed context-free language, and 
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Fig. 1. Both stacks and queues have long-term memory — but stacks have nested 
correlations, while queues have crossing ones. 

also show that it can be generated by a breadth-first context-free grammar. Since 
it is more easily recognized with queues than with stacks, we argue that the 
system's memory has a queue-like character. 

In addition, we show that various functions associated with the system, in- 
cluding its dynamical zeta function and the generating and growth functions of 
its symbolic dynamics, are transcendental. We show that this is true for period- 
multiplying cascades and general, and speculate as to whether these character- 
istics might be common to other kinds of dynamical phase transitions. 

A preliminary version of this work appeared in [14] . 

2 The period-doubling cascade 

Consider the family of functions on the unit interval [0, 1] 



As n increases, undergoes a series of bifurcations forming stable periodic or- 
bits of period 1,2,4,... Each new orbit is formed by a period-doubling transition 
from the one before it, which then becomes unstable. 

At any given stage in this process, there is a stable periodic orbit of period 
2™ which attracts almost all initial points, and unstable orbits with periods 2 fe 
for all k < n. All these bifurcations accumulate at the transition to chaos, at 
which all these orbits are unstable and an aperiodic attractor (of period 2°°) 
appears. 

If we label the two halves of the unit interval and 1 for x < 1/2 and x > 1/2 
respectively, any point x can be assigned a sequence or 'itinerary' in the following 
way: 



represents the symbolic dynamics [10] of F^. 

As we go through this cascade of bifurcations, the symbolic dynamics change 
accordingly. For ^ < 1/2, all points have images in the left half of the interval: 



F^(x) = 4/j,x(l - x) 




Then the language or set of sequences 



L = {a(x)\xe [0,1]} 



therefore the only possible itinerary is 000 • • • = 0* , where the * operator indi- 
cates or more repetitions. As fi increases, we get an unstable fixed point at 
and a stable one which migrates into the right half; the symbolic dynamics is 
now 0*1*, indicating that once we cross over into the right half- interval, we can 
never come back. 

At the next bifurcation, the stable fixed point splits into a period-2 orbit 
and becomes unstable. As fi increases further, one of these two points crosses 
back onto the left side, giving the orbit an itinerary of (01)*; the symbolic dy- 
namics now becomes 0*1* (01)*. As this process continues, we get a sequence of 
languages: 

L = 0*w* 

Li — 0*WqwIw2 



where L n is the language when the period-2" orbit is stable. The itineraries of the 
periodic orbits of period 2™ are wo = 1, w\ = 10, W2 = 1011, = 1011 1010, 
and so on. Note that w n+ \ consists of two copies of w n with the last symbol 
complemented; equivalently, w n +i — R(w n ) where R is the renormalization 

R : -» 11, 1 -> 10 

As we approach the transition to chaos, the stable periodic points approach an 
attractor whose itinerary is the fixed point of R, the Morse sequence !D M = 
1011 1010 1011 1011 • • •. The symbolic dynamics is then 

oo = W W 1 W 2 W 3 ' ' ' = U n=0 L n 

Note that unlike some other studies of this system [6, 12], we are including the 
transient part of the dynamics, rather than just the set {w n } of periodic orbits 
or the attractor w^. 

Each L n is a regular language [11] in that it is generated by a finite-state 
Markov process, or recognized by a finite-state machine. However, is not 
regular, since it has an infinite number of inequivalent states. This was first 
pointed out by Grassberger [9] who gave an infinite-state transition graph for 
Loo similar to that shown in figure 2. This means that the system has an infinite 
amount of memory. But what kind? 

3 Languages and automata for 
3.1 One stack can't do it 

Our approach to automata and grammars will be somewhat informal. We rec- 
ommend [11] as an introduction. 
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Fig. 2. The infinite-state transition diagram for L^. We can start from either bold 
node. 

A push-down automaton (PDA) is a machine with a single stack and a finite- 
state control. PDAs recognize the context-free languages (CFLs), so-called be- 
cause they can be generated by a grammar in which symbols can be transformed 
into strings regardless of the adjacent symbols. The canonical example is the 
Dyck language L = {e, (), (()), ()(), . . .} of properly nested words of parentheses 
(here e is the empty word). A PDA can recognize L by pushing a symbol on the 
stack when it sees a '(', popping when it sees a ')', starting and ending with an 
empty stack, and refusing to pop an empty stack. The corresponding grammar 



generates L from the initial symbol 7, where the comma indicates multiple op- 
tions for the production rule. Here 7 is a variable used during the derivation 
process, but all final words must consist only of the terminals '(' and ')'. 

This grammar is unambiguous, in that each word in L has a unique derivation. 
Since L has a grammar of this kind, it is called an unambiguous CFL. 

One useful property of context-free languages is the Pumping Lemma [11]: 
any sufficiently long word in L can be written aft-fSfi such that af3 k jS k /j, is also 
in L for all k > 0. For the Dyck language, (3 and 8 are simply the left and right 
parentheses. 

Crutchfield and Young [6] point out that the Pumping Lemma can be used 
to show that several languages related to L^, including {w n } and the set of 
substrings of Woo, are not CFLs. On the other hand, obeys the Pumping 
Lemma since initial substrings of a word can be repeated and considered part of 
the transient. In order to show that L^, is not a CFL, we need to use Ogden's 
Lemma [15], in which we can mark particular symbols in a word and demand 
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that they be among those pumped. Since the periodicity of a word in must 
increase from left to right, symbols near the end of w n for large n can't be 
repeated, and L M is not a CFL. The actual proof of this is given in the Appendix. 

A more powerful kind of machine is a stack automaton [11], which can delve 
into the stack in a read-only mode. A one-way stack automaton (1NSA) is one 
which can only read its input from left to right. Lakdawala [12] pointed out that 
the set {w n } cannot be recognized by a 1NSA, since another pumping lemma 
of Ogden [16] shows that any such language cannot be more than quadratically 
sparse, i.e. there must be sequences of words w n of length \w n \ < bn 2 for some 
b. Since the set of periodic orbits {w n } is exponentially sparse, it cannot be 
recognized by a 1NSA. Extending this proof to seems fairly difficult. 

3.2 A stack of stacks can 

An indexed context-free language [1] is a context-free language using indexed 
symbols, which have attached indices which are strings in another alphabet. 
When a symbol produces a string of symbols through a context-free production 
rule, the indices are copied to all the 'daughter' symbols. In addition, symbols 
can be pushed or popped from the index, which acts like a stack. The final 
product must consist of symbols with empty indices. 

Indexed CFLs are recognized by nested stack automata [2] . These are consid- 
erably more powerful than PDAs; in addition to reading the top symbol, they 
can move down into the stack in a read-only mode. They can also create a new 
sub-stack within a stack, and so on to any number of levels, but cannot move 
up past a sub-stack until it is empty. In essence, they have a stack of stacks. 

We need concern ourselves here just with indexed languages where the index 
alphabet consists of one symbol, say x. Then we can think of the indices as 
non- negative integers, by using the notation Ai for A indexed by the string x l . 

Consider the following indexed context-free grammar, in which 1„ produces 
w n , the itinerary of the period-2 n orbit: 

In * In— lOn— 1 
On y In— iTn— 1 

Since the set {w n } of periodic orbits is formed by iterating the renormalization 
R, we can also think of it as a OL-system [11]. Indexed grammars of this kind 
for {w n } were given in [6] and [12]. 

To produce blocks of w„'s with increasing n we add the productions 

/ -> 01, B Q 

B n * lni ln-^n: -^n+1 

This grammar produces L x . But to describe the symbolic dynamics completely, 
we really need to generate INIT^oo), the set of all initial substrings of L x , that 
don't necessarily end in a complete copy of some w n . 

Since the indexed context-free languages form an full abstract family of lan- 
guages, INIT^oc) is automatically also indexed context-free [11]. By adding a 



few new symbols and productions, we can give the following grammar for it 
explicitly: 
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Here the last block symbol l' n , of which each derivation tree has one running 
down its right edge, generates either w n or an incomplete initial substring of it. 
It is easy to show that this grammar is unambiguous. 

3.3 One queue can too 

Cherubini et al. [4] define a real-time queue automaton as a finite-state automa- 
ton with access to a finite number of queues, in which it is allowed to make 
exactly one computation step per input symbol. The educated reader may know 
that, given unlimited time, a machine with a queue can simulate a universal 
Turing machine with quadratic slowdown. However, restricting it to real time 
creates an interesting class of languages. 

In this section, we will show that is in the class QA l7 recognizable by a 
real-time deterministic queue automaton with one queue. The idea is simply to 
store the current periodic orbit w n on the queue, and apply the renormalization 
when necessary to expand it. 

Our automaton will have five states: zero, repeat, expand, expand2, and 
reject. Our queue alphabet will have two unmarked and two marked symbols, 
{0, 1, 0', 1'}. We will used the marked symbols 0' and 1' to mark when the queue 
comes to the end of some w n . A word will be accepted as long as the automaton 
does not enter the reject state during the input process. 

In our notation, we will use a and b as input symbols and q as the leftmost 
symbol on the queue. We will use R for a version of the renormalization which 
carries marks to the second symbol: — ► 11, 1 — ► 10, 0' — > 11', 1' — ► 10'. We 
will ignore marks when testing equations such as a = q or ab = R(q). Then the 
automaton's program is given in table 1. 

We start out in the zero state with an empty queue, and read 0s until we 
reach the first 1, whereupon we initialize the queue with wq = 1'. 

The repeat state simply matches the input symbol to the leftmost symbol 
on the queue, pops it, and pushes it on the queue's right end, so that we cycle 
through the word w n stored in the queue. Since w n +i consists of two copies of 
w n with the last symbol complemented, repeat enters the expand state if the 
last (marked) symbol of w n does not match the input. If there is a mismatch 
before the end of w n , there has been a mistake, and the machine rejects. 



zero: 

— If a — 0, then stay in zero 

— If a = 1, then push 1' on the queue and go to repeat 
repeat: 

— If a — q, then pop q, push q, and stay in repeat 

— If a / g and g is marked, then pop q, push g, and go to expand 

— If a / q and g is unmarked, then reject 

expand: 

— If a is the first symbol of -R(g) , then go to expand2 

— If a is not the first symbol of -R(g), then reject 

expand2: 

— If a is the second symbol of R(q) and g is unmarked, then pop g, push 
R{q) and stay in expand 

— If a is the second symbol of -R(g) and g is marked, then pop g, push R(q) , 
and go to repeat 

— If a is not the second symbol of -R(g) and g is marked, then pop g, push 
R{q), and stay in expand 

— If a is not the second symbol of -R(g) and g is unmarked, then reject 

reject: 

— Stay in reject 

Table 1. A five-state, one-queue machine that recognizes L^. 



The expand and expand2 states are a little tricky. When we entered them 
from repeat, we have w n on the stack, whose last symbol differed from the input. 
We then spend the next 2™ +1 input symbols expanding w n to w n+ \. If the input 
matches this to the end, that means that we are in a block of w Il+ i's, so we keep 
w n+ i on the stack and switch back to repeat. If, on the other hand, the last 
symbol of the input differs from uVi+i, then the w n+ i we saw was actually just 
the first half of a u>„ + 2, and we should stay in expand. This w n+2 is either the 
first one of a block of u>„ +2 's or the first half of a w n+ s 7 and so on. 

Finally, the re j ect state simply remembers that something went wrong along 
the way. We give an example of the machine's operation in table 2. 

Just as push-down automata correspond to context-free grammars, queue 
automata correspond to breadth-first context-free grammars [3] . Their production 
rules are of the form v — * a/3, where v is a variable, a is a string of terminals 
and (3 is a string of variables. But rather than inserting a(i where v appears in a 
derived string, we insert a at w's position and append /3 to the end of the string: 

xvy — > xotyft 

If we always apply the production rule to the leftmost (least recently produced) 
variable, the derivation mimics the operation of a queue automaton, where the 
string of variables represents the queue and the growing string of terminals 
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Table 2. An example of the queue automaton's operation. Here we accept the initial 
string 1 1011 1011 1010 = w w 2 w 3 . 



represents the part of the input that has been read so far. Then any language L 
recognized by a one-queue automaton can be written 

L = h(L BC F n R) 

where Lbcf is breadth-first context-free, R is regular, and h is a a homomorphism 
that maps symbols to strings [4]. 

The idea is that £bcf H R is a language in a decorated alphabet, whose 
variables are the queue symbols and whose terminals include both the input 
symbol and the machine's internal state. Then R enforces the proper transitions 
in the finite-state control, Lbcf keeps track of the queue, and h removes the 
decorations and leaves just the input. If the automaton operates in real time, with 
one step per input symbol, then h is alphabetic or length-preserving, mapping 
symbols to single symbols. 

We can easily transform the above queue automaton into a BCF grammar. 
In addition to marks at the end of the w n , our decorated alphabet will include 
subscripts z, r and e to indicate whether the machine was in the zero, repeat 
or expand states when the input was read, and subscripts r — > e and e — > r to 
indicate transitions. Then with our variables in bold, we have: 

i-»o z i, i;i' 

-» r 0, l e l e ll 

1 -> l r l, l e e 10 

0' -» 0;,0', lU e 0', l e lU, ll', leOUl' 
1' -> i;,l', 0U e l', leOUrlO', lelelO' 



Then let R be the regular language 



^ - Oz (C^r + A r ^ e A e (A e A e ) A e ^ r ) j4 r ) 

where = {0£., lj.} and so on. This ensures that transitions from r to e and 
back only occur when reading a marked symbol, and checks that the transitions 
are consistent. Finally, let h erase all marks and subscripts. 

This grammar produces growing strings of terminals with a growing queue. 
The reader can easily add additional subscripts indicating that no new variables 
will be generated, and add a condition to R that makes sure the input word ends 
with these terminals. This corresponds to simulating our deterministic queue 
automaton with a non-deterministic one, which guesses when to stop pushing 
symbols and accepts with an empty queue. 

3.4 Two stacks 

In this section, we will show that Loo can be recognized by a deterministic real- 
time automaton with access to two stacks. Equivalently, L M can be written 

Loo = h(L 1 nL 2 ) 

where L\ and L 2 are context-free languages. With no time restriction, two stacks 
are sufficient to simulate a universal Turing machine; restricting our automaton 
real-time means that h is again an alphabetic homomorphism. 

We use the fact that the tu n 's are palindromes except for their last symbol. 
If we again mark the symbols on which the queue cycles through to the last 
symbol of some w n , any word in can be written 

0*a' voa'iVia' 2 V2 • • • 
where each a! i is 0' or 1', and each Vi is a palindrome of 0s and Is. 
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Fig. 3. Using two context-free languages to repeat and expand from Vi to Vi+i. One 
language takes care of pairs where i is even, the other where i is odd. 

The reader can check that Vi+i is a copy of Vi if a[ = a' i+1 , and a renormalized 
copy R(vi) followed by 1 if Using the fact that Vi — Vi where v denotes 



v in reverse, if we define the context-free language 



L = 



a 1 v\a 2 v 2 



v 2 = vi if a[ = a' 2 

v 2 = R(vi) 1 if a[ 7^ a' 2 



and 



L x = INIT(0*L*) 
L 2 = INIT(0*1'L*) 

then L± and L 2 enforce this relationship between Vi and Vi + \ for even and odd i 
respectively. As an added bonus, L 2 makes sure a' — 1' and vq = e. If h removes 
marks as before, then = h(L\ fl L 2 ). 

The deterministic two-stack automaton this corresponds to is almost identical 
to the queue automaton of the previous section. We store the current Vi on one 
of the stacks. To repeat or expand we pop it off one stack and push it on the 
other, with or without renormalization. This reverses the order of the symbols, 
but that doesn't matter since the Vi are palindromes. We double the number 
of states in order to keep track of which stack we're currently reading from, 
and double them again to remember the last symbol a! i of the previous w n ; we 
compare this to the input a' i+1 and switch from repeat to expand if they don't 
match, and so on. The reader can easily fill in the details. 



4 Transcendentality 
4.1 The growth function 

There are several functions we can associate with a language. One is the growth 
function N(l), the number of allowed words of length I. For regular languages, 
the leading behavior of N is of the form l k X l where k is an integer and A is 
algebraic. Unambiguous context-free languages have the same form, except that 
k may be rational [8] . For instance, the growth function of the Dyck language is 

the sequence of Catalan numbers, N(2l) = j^j^i^j ~ l~ 3 / 2 4 l /^/tt. 

Throughout this section, we will refer to INIT(ioo) simply as L^. To calcu- 
late N(l) for Loo, consider two of its subsets, 

L' = INIT(1*(10)*(1011)*---) 

i.e. the set of words in Lqc that don't start with 0, and 

L" = INIT((10)*(1011)* • • •) 

i.e. the set of words in that don't start with or 11. Let N'(l) and N"(l) be 
the growth functions of L' and L" respectively. 



Then note that any word in -Lqo consists of a word in L f preceded by none 
or more Os, and similarly any word in L' consists of a word in L" preceded by 
none or more Is. That is, = 0*L' and V = 1*L". Then 

l 

N(l) = J2N'(j) 
3=0 
i 

N'(l) = J2 N "U)- 1 iorl>0 (1) 

3=0 

(The —1 here prevents double-counting of the word l', since both 1 and e are 
members of L". Obviously, we can ignore it for large I.) In addition, the rcnor- 
malization R gives a one-to-one correspondence between words in L 1 and words 
in L" that are twice as long, so 

N'(l) = N"(2l) (2) 

Now if we approximate (1) as an integral, combine it with (2), and differentiate 
both sides, we get 

dN'(l)/dl = N'(l/2) 

Solving this with series yields 

00 in 

n'(i) cx y - , 1W9 

n=0 

which clearly converges. Its leading behavior for large I is 

N'(l) CX e A (^lf + B\nl lnln( + ClnZ _ ^A\nl ^Blnlni 

where 

1 



2 In 2 

B = — 

In 2 

„ 1 1 In In 2 

C = — I 1 

2 In 2 In 2 

To obtain N(l) we just integrate N'(l), which changes C to C + 1 . 

This leading behavior of l A ln ' grows faster than any power law, but slower 
than an exponential. 



4.2 The generating function 

Another function we can associate with a language is its generating function 

oo 

G{z) = Y,N{l)z l 

1=0 

G(z) is rational for regular languages, and algebraic for unambiguous context- 
free languages [5]. 

As before, consider L' and L", and let their generating functions be G and 
G" respectively. Equation (1) now becomes 

2, 



G(z) = (l + z + z 2 + ...)G'(z) 



1 - z 



G'(z) = (1 + z + z 1 + • • •) G"(z) -(z + z 2 + ■■■) = G "[ Z l z Z (3) 

In addition, every word in L" of even length is a renormalizcd word in V ', and 
every word in L" of odd length is a renormalizcd word in L' followed by 1 . This 
gives 

G"(z) = (l + z)G'{z 2 ) 
which when combined with (3) gives 



G'(z) = ^G'(z 2 )- Y 



z 



This can be simplified somewhat by writing G'{z) — + 1. Then 

g( Z ) = Z +J—g(z 2 ) (4) 
I — Z 

Using the fact that g(0) = since G'(0) = 1, this gives 

9(z) = z + ^-(z 2 + -L_ {z ± + . .))) 

1 2 1 1 4 

= Z+ — Z + —— Z 

Since N(l) grows less than exponentially, G(z) will converge for \z\ < 1 and 
diverge at \z\ — 1. We're particularly interested in the nature of that divergence. 
If z = 1 — e where e is small, (4) becomes 

/(e) w e-\f(2e) 

where /(e) = g(l — e). As e goes to zero, this gives a leading behavior of 

/(e)oce Aln£ e- s 

with A = 1/(2 In 2) and 5 = 1/2. Then G'{1 - e) and G(l - e) diverge with the 
same form, except that B = 3/2 and B — 5/2 respectively. 



There are two ways to see that G is transcendental. The first is that the poles 
of an algebraic function must diverge polynomially, and e Alne diverges faster 
than any polynomial. The other is that there is a pole at z = e %e whenever 9 is a 
rational fraction of 2ir, since these points are periodic under the transformation 
z — > z 1 . Thus the unit circle has a dense set of poles and forms an essential 
boundary, outside which G cannot be analytically continued. 



4.3 The dynamical zeta function 

Many time-averaged properties of dynamical systems, such as Lyapunov expo- 
nents, repeller lifetimes, etc. can be calculated from a generating function for 
the set of periodic orbits of the system. We can write this as a sum over the set 
of prime cycles {p}; since eacn one can start at p different points, and can be 
repeated any number of times, we have 



G(z) = ]T 



pz p 



{p} 

This can be written as a logarithmic derivative of a dynamical zeta function ( 
[7, 17]: 

G(*)=*|]og C 

where 

c 1 (*) = II( 1 -* p ) 

M 

For the one-humped map at the transition to chaos, we have one prime cycle of 
period 2 k for each k, and another of period 1, so 

C\z) = {l-z)g{z) 

where 

g(z) = (l-z)(l-z 2 )(l-z i )--- = (l-z)g(z 2 ) 

giving ( a pole at e l9 whenever 9 is a dyadic rational fraction of 2-7T. Letting 
g(\ — e) = /(e) gives a similar recurrence as before, 

/(e) « e/(2e) (5) 

giving 

C(l -e) oce Aln£ e B 

with, once again, A = 1/(2 In 2) and B = —3/2. This divergence for the zeta 
function was pointed out in [13]. 



5 Conclusion 



We have studied the period-doubling transition to chaos from a computational 
and analytical point of view. Since one queue suffices to recognize its symbolic 
dynamics in real time, but two stacks are necessary, we argue that the system's 
long-term memory has a first-in, first-out character. We have also shown that 
its growth, generating, and dynamical zeta functions have an interesting tran- 
scendental form. 

To what extent are these things common to other dynamical phase tran- 
sitions? For period-multiplying cascades, the transcendental forms remain the 
same. If we have a series of orbits of period r k for some r other than 2, the 
recurrence (5) becomes 

/(e) = e/(rc) 

giving exactly the same divergence for (, except that now A = 1/(2 In r). The 
growth and generating functions behave similarly. 

As far as grammatical structure is concerned, the indexed context-free gram- 
mar and queue automaton given above can clearly be adapted to a wide variety 
of renormalizations. But how general is our construction of a two-stack automa- 
ton? The Morse sequence 0110 1001 1001 0110 • • • generated by -> 01, 1 -» 10 
is its complement in reverse, so a two-stack machine can easily be constructed 
for it. On the other hand, the Fibonacci sequence abaab aba abaab ■ ■ ■ generated 
by a — > ab, b — > a, which occurs in the quasiperiodic behavior of coupled oscilla- 
tors, can be recognized with a queue [4] but does not seem to have any obvious 
palindromic or stack-like structure. We leave this as an open question. 
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A Appendix: is not context-free 

In this Appendix, show that and INIT^oc) arc not context-free, and prove 
a number of enjoyable combinatorial results about these languages on the way. 
We will use another useful characterization of the w n , 

w a = 1 
wi = 10 

w n — %-iW^_ 2 for n > 2 

Iterating this gives us a useful equation: 

w n — w i wf_ 1 wfwf +1 ■ ■ ■ w 2 n _ 2 for < i < n (6) 

Then we can define as 

Loo = {w^Wi 2 w i3 ■ ■ ■ w im : h < i 2 < ■ ■ ■ < i m , < m} 

We will call initial and final substrings heads and tails respectively, and call 
j3 a proper head of a if (3 = where 7 is non-empty, i.e. \/3\ < \a\. Then to 
describe words in INIT^oo), we use the following lemma: 



Lemma 1. If is a proper head of w n , then it can be decomposed into Wi of 
strictly decreasing size, i.e. — w^Wi 2 ■ ■ ■ Wi m where n > i\ > i 2 > ■ ■ ■ > i m for 
some m > 0. Therefore, INIT(Loo) can be written 

INITXLoo) = {w h w l2 w i3 ■■■w im : 

i 1 < i 2 < ■ ■ ■ < i k > i k+1 > • • • > i m , < k < m} (7) 

with decompositions consisting of a non- decreasing part followed by a strictly 
decreasing part. 

Proof. The proof is by induction on n. The first half of w n is w n -i, and the 
second half of w n only differs from tu n _i on its last symbol. Therefore, if is a 
proper head of w n , we have two cases: if \/3\ < 2 n , then f3 is a proper head of 
tu n _i, and if \[3\ > 2 n_1 , then [3 = w n _i(3' where (3' is a proper head of io n _i. 
Since the only proper head of wq is the empty string, the induction is complete. 

To get equation (7), note that any word in INIT(Loo) can be written Wi 1 • • • Wi k /3 
where i\ <■ ■■ <ik and [3 is a (possibly empty) proper head of Wi k+1 . □ 

We will also find the following useful: 

Lemma 2. If (3 is a proper tail of w n , then it can be decomposed into Wi of 
non- decreasing size, i.e. (3 — w^w^ ■ ■ ■ Wi m where i\ < i 2 < • • • < i m < n for 
some m > 0. 

Proof. We use a induction similar to that in the previous lemma. Let (3 be a 
proper tail of w n . Recall that w n = w n -\w\_ 2 - If \(3\ < 2 n ~ 2 , then [3 is a proper 
tail of w n - 2 . If 2 n ~ 2 < \[3\ < 2™- 1 , then [3 = l3'w n - 2 where (3' is a proper tail of 
w n - 2 . Finally, if 2™" 1 < \(3\ < 2™, then = 0'w 2 n _ 2 where 0' is a proper tail of 
w n -i- 

Thus every tail has a non-decreasing decomposition, in which, moreover, each 
Wi is repeated at most twice. □ 

We now prove an important lemma. Note that if a is in INIT(Loo), so are 
all its substrings. Call a head or tail a w-head or w-tail if it consists of some Wi. 

Lemma 3. The decomposition of words in INIT^^) given in equation (7) is 
unique. 

Proof. Call a decomposition a — w^w^w^ ■ ■ ■ Wi m legal if it is first non-decreasing 
and then strictly decreasing, i\ < ■ ■ ■ < ik > ik+i > • • • > i m for some k < m. 
We wish to show that no word has more than one legal decomposition. To prove 
this by contradiction, let a be a word of minimal length with two distinct legal 
decompositions, 

a = w tl w t2 ■••w im 
= w jl w h ---w jn 

Since a has minimal length, these must differ on the first block, i.e. i\ ^ j\. 
Otherwise, a = Wi 1 where is a shorter word with two legal decompositions. 



We assume without loss of generality that i\ < ji and apply equation (6) to 
the second decomposition, obtaining 

a = w^Wiz ■ ■ ■ Wi m 

Then if a = we have 

[3 = w i2 ■ ■ ■ w im 

= w n~i w l ■ ■ ■ W n-2W J2 ■ ■ ■ w Jn 

Both these decompositions are legal. If i\ < ?2, we have — 1 < «2, and they differ 
on the first block. If ii < i\ then either 12 < ii — 1 and they differ on the first 
block, or 12 = i\ — 1. But in this case a's first decomposition is decreasing all the 
way, so 13 < 12 and they differ on the second block. In every case, (i is a shorter 
word with two legal decompositions, again contradicting our assumption that a 
has minimal length. The proof is completed by noting that the decomposition 
of the empty word is unique! □ 

This uniqueness has the following corollary. 

Lemma 4. Each of the Wi in a 's legal decomposition is the largest w-head of the 
tail of a starting with it, and the largest w-tail of the head of a ending with it. 
Furthermore, any w n that occurs as a subword in a must be contained in some 
Wi in a 's decomposition; it cannot cross the boundaries between the Wi . 

Proof. To show that is the largest w-head of a, suppose that w n is a w-head 
of a where n > i\. Then w n — Wi 1 ■ ■ -w^P for some I, where [3 is a head of Wi l+1 . 
Then using lemma 1 on (3 gives a legal decomposition for w n other than w n itself 
(note that the decomposition given by equation (6) is not legal). Therefore, any 
w-head of a must be contained in w il . By induction, each of the w, is the largest 
w-head of the part of a starting with it. 

On the other end, we wish to show that w im is the largest w-tail of a. If w„ 
is a w-tail of a where n > i m , then w n = (3w il ■ ■ ■ w im for some I, where (3 is a 
tail of Wi l _ 1 . Using lemma 2 on (3 gives a legal decomposition for w„ other than 
itself, and we again have a contradiction. By induction, each Wi is the largest 
w-tail of the part of a ending with it. 

Finally, suppose some subword w n of a is not contained in any of the w^'s 
of a's decomposition, and that it crosses one or more boundaries between the 
Wi. Then w n = f3wi t ■ ■ ■ Wi p j where (3 is a tail of Wi l _ 1 and 7 is a head of w ip+1 . 
Using lemmas 1 and 2 again gives a decomposition for w n other than itself. □ 

This means that given a € INIT^oo), we can find a's legal decomposition by 
proceeding from the left (or the right) and taking the largest w-head (or w-tail) 
at each step. Note that the uniqueness of this decomposition is also implicit in 
our calculation of the generating function for INIT^oo) in section 4.2. 

Next, we prove that the w n don't have square heads, cubic tails, or quartic 
subwords. 



Lemma 5. No w n has a head of the form a 2 , a tail of the form a(3a(3a, or a 
subword of the form a 4 , where \a\ > 0. 



Proof. We prove the first statement first. It is true for wq- Suppose w n = aaS, 
and assume that no smaller w n can be written this way. We must have \a\ > 2 n ~ 2 
since otherwise w n -\ has aa for a head. Therefore, the middle of w n falls inside 
the second a, dividing it into two subwords a = [3j. Then w n = fjjfljS, and since 
w n 's first and second halves are identical except for the last symbol, we have 
w n -\ = f3-f[3 — 7<5, where v denotes a word v with the last symbol complemented. 

We now have two cases. If I7I > \(3\, write 7 = (3C, and w n -i = (3f3(P has 
a square head. If \[3\ > \-f\, write [3 = 7^ and w n — 7C77C The middle of w n 
must divide the second 7 into 7 = n9, so w„-2 = r)9(n — 9n9(. If \9\ > \r]\, write 
9 = rjn and w„_2 = rjriK^r}; if \rj\ > \9\, write rj = 9k and w„_2 = 99n9l^. In 
either case, ro„_2 has a square head. By contradiction, no w n can have a square 
head. 

To show that no w n can have a tail of the form af3af3a where \a\ > 0, 
recall that w n with its last symbol removed is a palindrome. Suppose that w n = 
Saf3af3a and a — 7a where a is a's last symbol. Then w n = Sjaf3^a(3-fa = 
^ffSarfparfSa, and w n has a square head (j(3a) 2 where v denotes a word v written 
in reverse. 

Finally, suppose w n has a quartic subword, w n = (3a 4 -f, and no smaller word 
does. Then the middle of w n must divide one of the en's into a — £77, where 
without loss of generality £ is non-empty. Then w n = /3£ • r/( ■ rj( ■ r/C ■ rjj where 
the middle of w n falls on one of the four -'s. The first two give w n -i a square 
head ri(ri(, and the last two give w n -i a tail C^C^C- By contradiction, no w n can 
have a subword of the form a 4 . □ 

We now prove a result which will be instrumental in showing that words in 
INIT(Loo) can only be pumped in a limited way. Call a a w-cycle if it is a cyclic 
permutation of a non-zero power of some w n , i.e. a = e or (3ujPj for some p > 
where 7/3 = w n . 

Lemma6. If a, a 2 and a 3 are in INIT^^), then a is a w-cycle. 
Proof. Let a and a 2 have legal decompositions 



a 2 = Wi t Wi 2 ■ ■ ■ Wi k ■ ■ ■ Wi m ■ Wi 2 • • • Wi k ■ ■ ■ Wi m 
= w h w h ■■■w jl ■■■w jn 

where h < i 2 < • • • < ik > • • • > i m and ji < j 2 < ■ ■ ■ < ji > ■ ■ ■ > j n . 

First, note that i p — j p for p < k, so that ■■■Wi k _ 1 starts the legal 
decomposition of a 2 . If not, the first Wj p for which this is not true, or the 
head of it that overlaps the first a, has a decomposition starting Wi p Wi p+1 ■ ■ ■ 
where i v < i P +i- This would contradict lemma 1, which shows that the legal 
decomposition of a head of Wj p is strictly decreasing. 



Similarly, i p = j p _ m+n for p > k, so that Wi k+1 ■ ■ ■ w im concludes the legal 
decomposition of a 2 . Otherwise, the last wj p for which this is not true, or its 
tail that overlaps the second a, has a decomposition ending • • ■Wi p _ 1 Wi p where 
ip-i > i p , whereas by lemma 2 its decomposition must be non-decreasing. 

Therefore, we write 

« 2 = W H ■ ■ ■ w i k -l P w ik+l ' ' ' w t m 

where 

P = w ik ■ ■ ■ w lm ■ w ix ■ ■ ■ w ik 
= Wj k ■ ■ ■ Wj k _ m+n 

where Wj k ■ ■ ■ Wj k _ m+n is /3's legal decomposition. We will now explore a series 
of cases regarding the structure of f3. 

Case I: j k = i k and j k - m +n ^ h- We must have j k+1 > j k = i k . Otherwise, 
/3's decomposition starts out decreasing, and its length is a sum of decreasing 
powers of 2 totalling less than 2 • 2 lk , which contradicts the fact that it starts 
and ends with Wi k . Since i k +\ < ik, we have jk+i > ik+i, and so Wj k+1 must 
overlap the second a to avoid having a legal (decreasing) decomposition other 
than itself. Similarly, jk-m+n > ik by lemma 4, and Wj k _ m+n must overlap with 
the first a to avoid having a non-decreasing decomposition. 

Since this means Wj k+1 and Wj k _ m+n overlap, they must coincide. Then (3 = 
Wi k Wj k+1 and a is a cyclic permutation of Wj k+1 = Wi k+1 ■ ■ ■ u>i m ■ Wi ■ ■ ■ Wi k . 

Case II: j k ^ i k and j k - m +n ^ ik- By lemma 4 we have j k > i k , so w jk 
must overlap with the second a to avoid having a decreasing decomposition. As 
in Case I, Wj k _ m+n must overlap with the first a to avoid having a non-decreasing 
decomposition. Therefore Wj k and Wj k _ m+n coincide, and (i — Wj k . 

However, this leads to a contradiction when we consider a 3 . (For instance, if 
a = 110101, then [3 = 10111010 = w 3 , and a and a 2 arc in INIT^), but a 3 
is not.) By the same argument as for a 2 , the legal decomposition of a 3 begins 
with Wi ± ■ ■ ■ Wi k _ 1 and ends with Wi k+1 ■ ■ ■ Wi m , so we can write 

a 3 = w h ■ ■ ■ w ik-l P $ w ik + l ■ ■ ■ w i m 

where 

S = w lk+l ■ ■ ■ w im ■ w ix ■ ■ ■ w ik 

and P = Wi k S. Since <5 is a tail of (3 — Wj k , by lemma 2 its decomposition is 
non-decreasing. On the other hand, the legal decomposition of a 3 must include 
(3 = Wj k and then decrease, since otherwise the block starting with (3, or the next 
one, would run past the end of S. Since S's decomposition is both decreasing and 
non-decreasing, it must be a single w s for some s; but this gives (3 — Wj k — w ik w s , 
a contradiction. 

Case III: j k ^ i k and j k - m + n = ik- As in Case II, j k > i k and Wj k must 
overlap the second a. Let q be the rightmost index such that j q ^ i q + m -n- 
Then j q > i q + m - n by lemma 4, so Wj q must overlap the first a, and Wj k and 



w jq coincide. Moreover, the sequence j q+1 = i q+m _ n+1 , . . .,j k _ m+n = i k is non- 
decreasing, but it must also be decreasing since j k > ik- Thus q = k — m + n — 1 
and this sequence consists only of j k _ m+n = ik, so P — w j k w i k an d a is a cyclic 
permutation of Wj k = Wi k ■ ■ ■ w m ■ u>\ ■ ■ ■ u>i k _ 1 . 

Case IV: j k = i k and j k - m +n = ik- As in Case I, j k+1 > i k+1 and w jk+1 
must overlap the second a. As in the previous case, let q be the rightmost index 
such that j q 7^ i q+m - n . Then Wj q must overlap the first a, so Wj k+1 and Wj q 
coincide. Moreover, the sequence j q+1 = i q + m - n +i, ■ ■ ■ , jk-m+n — ik is non- 
decreasing, so these must all be equal to i k since j k = i k . Thus (i = Wi k Wj q wf k 
wherep = k — m+n — q > 1. If p > 1, we must have j q = i k for this decomposition 
to be legal, (5 = w[ +2 , and a is a cyclic permutation of w[ +1 . 

If p — 1, we need to examine a 3 to eliminate the possibility that j3 = 
Wi k Wj q Wi k where j q > i k . (For instance, if a = 011101, then (i = 101011 10 = 
w\W2Wi, a and a 2 are in INITfXoo), but a 3 is not.) This would give us 

a 3 = w h ■ ■ ■ w ik _ 1 7 w ik+1 ■ ■ ■ w im 

where 7 = Wi k Wj q Wi k Wj q Wi kl or 7 = WiWjWiWjWi for short. We will show that 
no such word with j > i is in INIT(Loo). 

Suppose the largest it;- head of 7 is some w larger than Wi. By lemma 4 the 
Wi and Wj cannot be cut by tu's right boundary, so w k must be WiWj, WiWjWi, 
WiWjWiWj, or WiWjWiWjWi. The first two of these give w a legal decomposition, 
while the last two give it a square head and violate lemma 5. So w = Wi and 7's 
legal decomposition starts with Wi. 

Next, suppose the largest w-head of WjWiWjWi is some w larger than Wj. The 
same arguments eliminate all possibilities except w — WjWiWj; but if j > i, the 
length of this cannot be a power of 2. So 7's legal decomposition starts with 
WiWj. Since the remainder of 7 is already legally decomposed into WiWjWi, its 
entire decomposition must be WiWjWiWjWi; but for this to be legal, we must 
have i = j. 

Thus if p = 1, then j q = ik, (i = w 3 k and a is a cyclic permutation of wf k . 
This completes the proof for all cases. □ 

We are now in a position (finally!) to prove the following: 

Theorem 7. Neither L m nor INIT(L oo) are context-free. 

Proof. Ogden's Lemma [15] states that for any context-free language L, there is 
a constant k such that if k or more symbols in a word £ e L are designated as 
'distinguished,' then £ = ap'ySfi where: 

— 7 contains at least one distinguished symbol 

— Either f3 or 5 contains at least one distinguished symbol 

— If (i (rcsp. 5) contains distinguished symbols, then a (resp. fi) does also 

— (3^5 contains at most k distinguished symbols 

— aP p jS p iJ, e L for all p > 



We will refer to a0 p j8 p /j, as £ p . 

Suppose INIT(Loo) is context-free. Then let £ be w n with the last symbol 
removed, i.e. £ = ffl*tii n _iid n _ 2 ■ ■ ■ w , with n chosen such that 2™ > k. Mark the 
last k symbols of £ as distinguished; these are all to the right of the last w n . 

Since 7 contains a distinguished symbol, 8 cither consists entirely of distin- 
guished symbols or is empty. In the first case, since 8 can be pumped, it must be 
a w-cycle by lemma 6, and since \8\ < k < 2 n , it must be a cyclic permutation 
of some Wi with i < n. For p > 5, then, £ p contains a quartic subword wf. By 
lemma 5 this must appear in £ p 's legal decomposition, which must therefore be 
non-decreasing at this point. But this is a contradiction, since with 4 copies of 
w n to the left of 8, either contains w n or both a and 7 do, and in either case 
w n occurs more than once in £ p to the left of wf. 

If 8 is empty, then both a and (3 must contain distinguished symbols, in 
which case consists entirely of them. Then must be a cyclic permutation of 
some Wi with i < n, and the same argument goes through. In both cases, £ p <^ L 
for p > 5, and INIT(Loo) is not context-free. 

The context-free languages are closed under INIT [11], so since INIT(L 00 ) is 
not context-free, L M cannot be either. □ 

We are confident that the same techniques can be applied to show that 
and INIT(Xoo) are not recognizable by one-way stack automata. The difficulty 
is to put an upper bound on the size of the 0j and tpj (in the notation of [16]) 
as we did here for 8 and 0. 
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